39-00 



SELECTIVE LABELING AND ISOLATION OF PHOSPHOPEPTIDES AND 
APPLICATIONS TO PROTEOME ANALYSIS 

STATEMENT REGARDING U.S. GOVERNMENT RIGHTS 
This invention was made with U.S. Government support under National Science 
Foundation grant No. BIR92 14821 AM04 and under National Institutes of Health grant no. 
IR33CA84698. The United States government has certain rights in this invention. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application takes priority from U.S. provisional application serial no. 
60/210,972, filed June 12, 2000, which is incorporated by reference herein in its entirety. 

FIELD OF THE INVENTION 
A general method for selective chemical labeling of phosphate groups is provided 
which facilitates highly specific purification of molecules containing one or several 
phosphate group(s). The method is applicable to the selective labeling of phosphates in 
phosphoproteins and phosphopeptides. When combined with mass spectrometric techniques, 
the method can be employed to detect and identify phosphorylated proteins in complex 
mixtures and to precisely identify the phosphorylated amino acid. The invention has 
application in the field of proteomics where it facilitates the quantitative, global analysis of 
protein phosphorylation in a cell or tissue. 

BACKGROUND OF THE INVENTION 
Proteins are essential for the control and execution of virtually every biological 
process. The rate of synthesis and the half-life of proteins and thus their expression level are 
also controlled post-transcriptionally. Furthermore, the activity of proteins is frequently 
modulated by post-translational modifications, in particular protein phosphorylation, and 
dependent on the association of the protein with other molecules including DNA and 
proteins. Neither the level of expression nor the state of activity of proteins is directly 
apparent from the gene sequence or even from the expression level of the corresponding 
mRNA transcript. A complete description of a biological system must therefor include 
measurements that indicate the identity, quantity and the state of activity of the proteins 
which constitute the system. The large-scale (ultimately global) analysis of proteins 



expressed in a cell or tissue has been termed proteome analysis (Pennington, S. R., Wilkins, 
M. R., Hochstrasser, D. R, and Dunn, M. J. (1997), "Proteome analysis: From protein 
characterization to biological function," Trends Cell Bio. 7:168-173). 

At present no protein analytical technology approaches the throughput and level of 
automation of genomic technology. The most common implementation of proteome analysis 
is based on the separation of complex protein samples, most commonly by two-dimensional 
gel electrophoresis (2DE), and the subsequent sequential identification of the separated 
protein species (Ducret, A. et al. (1998), "High throughput protein characterization by 
automated reverse-phase chromatography/electrospray tandem mass spectrometry," Prot. Sci. 
7:706-719; Garrels, J. I. et al. (1997), "Proteome studies of Saccharomyces cerevisiae: 
identification and characterization of abundant proteins. Electrophoresis," 75:1347-1360; 
Link, A. J. et al. (1997), "Identifying the major proteome components of Haemophilus 
influenzae type-strain NCTC 8143," Electrophoresis 75:1314-1334; Shevchenko, A. et al. 
(1996), "Linking genome and proteome by mass spectrometry: large-scale identification of 
yeast proteins from two dimensional gels," Proc. Natl. Acad. Sci. U.S.A. 93:14440-14445; 
Gygi, S.P. et al. (1999), "Correlation between protein and mRNA abundance in yeast," Mol. 
Cell. Biol. 19:1720-1730; Boucherie, H. et al. (1996), "Two-dimensional gel protein database 
of Saccharomyces cerevisiae," Electrophoresis 17:1683-1699). 

The 2DE approach has been revolutionized by the development of powerful mass 
spectrometric techniques and computer algorithms which correlate protein and peptide mass 
spectral data with sequence databases and, thus, rapidly and conclusively identify proteins 
(Eng, J., McCormack, A., and Yates, J. I. (1994), "An approach to correlate tandem mass 
spectral data of peptides with amino acid sequences in a protein database," J. Am. Soc. Mass 
Spectrom. 5:976-989; Mann, M., and Wilm, M. (1994), "Error-tolerant identification of 
peptides in sequence databases by peptide sequence tags," Anal. Chem. 56:4390-4399; Yates, 
J. R. et al. (1995), "Method to correlate tandem mass spectra of modified peptides to amino 
acid sequences in the protein database," Anal. Chem. 67:1426-1436). 

This technology has reached a level of sensitivity which now permits the 
identification of essentially any protein which is detectable by conventional protein staining 
methods including silver staining (Figeys, D., and Aebersold, R. (1998), "High sensitivity 
analysis of proteins and peptides by capillary electrophoresis tandem mass spectrometry: 
Recent developments in technology and applications," Electrophoresis 79:885-892.; Figeys, 
D. et al. (1998), "Electrophoresis combined with mass spectrometry techniques: Powerful 



tools for the analysis of proteins and proteomes," Electrophoresis 19:181 1-1818; Figeys, D. et 
al. (1997), "A micro fabricated device for rapid protein identification by microelectrospray ion 
trap mass spectrometry,' 1 Anal. Chem. 69:3153-3160; Figeys, D. et al. (1996), "Protein 
identification by solid phase microextraction-capillary zone electrophoresis- 
5 microelectrospray-tandem mass spectrometry," Nature Biotech. 74:1579-1583; Shevchenko, 
A. et al. (1996), "Mass spectrometric sequencing of proteins silver-stained polyacrylamide 
gels," Anal. Chem. 65:850-858). However, the sequential manner in which samples are 
processed limits the sample throughput. The most sensitive methods have been difficult to 
automate and low abundance proteins, such as regulatory proteins, escape detection without 
10 prior enrichment, thus effectively limiting the dynamic range of the technique. In the 2DE- 
based approach proteins are quantified by densitometry of stained spots in the 2DE gels. 

m The development of methods and instrumentation for automated, data-dependent 

electro spray ionization (ESI) tandem mass spectrometry (MS n ) in conjunction with 
gjj microcapillary liquid chromatography (^iLC) and database searching has significantly 

increased the sensitivity and speed of the identification of gel-separated proteins. As an 
i2 alternative to the 2DE/MS" approach to proteome analysis, the direct analysis by tandem mass 
W spectrometry of peptide mixtures generated by the digestion of complex protein mixtures has 
f=i been proposed (Dongr'e, A. R. et al. (1997), "Emerging tandem-mass-spectrometry 
O 1 techniques for the rapid identification of proteins," Trends Biotechnol. 75:418-425). jiLC- * 
2ft^ Ms/MS has also been used successfully for the large-scale identification of individual 
Q proteins directly from mixtures without gel electrophoretic separation (Link, J. et al. (1999), 
"Direct analysis of large protein complexes using mass spectrometry," Nat. Biotech.l7:676- 
682; Opiteck, G.J. et al. (1997), "Comprehensive on-line LC/LC/MS of proteins," Anal. 
Chem. 69:1518-1524.) 

25 While these approaches dramatically accelerate protein identification, the quantities of 

the analyzed proteins cannot be easily determined due to the observation that mass 
spectrometers are inherently not quantitative devices. Direct mass spectrometric analysis of 
protein mixtures by mass spectrometry can be made quantitative by the application of stable 
isotope dilution theory, whereby two chemically identical analytes (one representing an 

30 internal standard and the sample to be measured) are labeled with stable isotope tags of 

identical chemical composition but different mass. This principle has been implemented in 
quantitative proteome analysis by the development of a class of chemical reagents termed 
isotope coded affinity tags (ICAT). (Gygi, S.P. et al. (1999), "Quantitative analysis of 
complex protein mixtures using isotope-coded affinity tags," Nat. Biotechnol. 17, 994-999.) 
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ICAT reagents and their application to the analysis of complex protein mixtures have been 
shown to substantially alleviate the dynamic range problem encountered by the 2DE/Ms n 
approach. 

Protein phosphorylation is one of the most important regulatory events in cells. T3ie 
state of activity of numerous enzymes and processes and the association of specific proteins 
into functional complexes are frequently controlled by reversible protein phosphorylation 
(Graves, J.D. & Krebs, E.D. (1999), "Protein phosphorylation and signal traduction," 
Pharmacol Ther. 82, 111-121; Koch, C. A. et al. (1991), " SH2 and SH>^omains: elements 
that control interactions of cytoplasmic signaling proteins," Science 252, 668-674; Hunter, T. 
(1994), "1001 protein kinases redux-towards 2000," Semin. CeWBiol 5, 367-376). The 
principle goals of studying protein phosphorylation are the kiCTitification, quantitation and 

etermination of the biological function of phosphorylatjem site(s) in phosphoproteins. Much 
of the difficulty in such studies lies in the fact that many phosphoproteins exist at only at very 
low abundance. Further, proteins are often phospnorylated at a low stoichiometry and at 
multiple sites. Therefore, it is usually difficutfto obtain sufficient amounts of pure 
phosphoprotein for such analyses. All cjKrent methods for the analysis of the 
phosphorylation state of proteins foc**s on one purified phosphoprotein at a time (Verma, R. 
et al. (1997), "Phosphorylation of'Siclp by Gl Cdk required for its degradation and entry into 
S phase," Science 278, 455-60; Watts, J.D. et al. (1994), "Identification by electrospray 
ionization mass spectrometry of the sites of tyrosine phosphorylation induced in activated 
Jurkat T cells on thepfetein tyrosine kinase ZAP-70," J. Biol. Chem. 269, 29520-29529; 
Gingras, A.C. et al (1999), "Regulation of 4E-BP1 phosphorylation: a novel two-step 
mechanism,"/Oe«ey Dev. 13, 1422-1437). Because cellular proteins are coordinately 
phosphorylated to control specific biological processes, the complex mechanisms that 
contrel biological systems by protein phosphorylation are difficult to investigate using 
ctfn-ent technology. 

Because phosphopeptide(s) typically are infrequent and of low abundance in protein 
digests, highly purified or enriched phosphopeptide samples are needed for mass 
spectrometric analysis. The need to selectively enrich for phosphopeptides prior to MS 
analysis is particularly urgent if a protein mixture rather than a single purified phosphoprotein 
is being analyzed. In addition, no MS-based method to quantify protein phosphorylation 
directly is currently available. Quantitative study of protein phosphorylation often involves 
methods such as 32 P radiolabeling (Oda, Y. et al. (1999), "Accurate quantitation of protein 
expression and site-specific phosphorylation," Proc. Natl. Acad. Sci. USA 96:6591-6596). 



Therefore, an MS-based method that allows both the identification of the sites of 
phosphorylation from complex mixtures of proteins and their quantitation will be an 
essential part of proteome analysis. 

Thus, there is a substantial need in the art for a more rapid and general method for the 
analysis of protein phosphorylation, particularly in complex protein mixtures, that does not 
require purification to homogeneity of individual phosphoproteins. The present invention 
provides such a method. 

SUMMARY OF THE INVENTION 
This invention provides a method for selective labeling of phosphate groups in natpr£l 

and synthetic oligomers and polymers in the presence of chemically related groups^tfoi as 

carboxylic acid groups. The method is specifically applicable to biological oligomers and 

polymers, including phosphopeptides, phosphoproteins and phospholipid. Selective labeling 

of phosphate groups in proteins and peptides, for example, facilitates separation, isolation and 

detection of phosphoproteins and phosphopeptides in copa^Iex mixtures of proteins. 

Selective labeling can be employed to selectivelvj^ifroduce phosphate labels at phosphate 

groups in an oligomer or polymer, e.g., in a^feptide or protein. Detection of the presence of 

the label, is used to detect the presepe^of the phosphate group in the oligomer or polymer. 

The method is of particular u£e"ior the detection of phosphoproteins or phosphopeptides. The 

phosphate label can be^colorimetric label, a radiolabel, an isotope label, a fluorescent or 

phosphorescentiat^el, an affinity label or a linker group carrying a reactive group (or latent 

reactive srdup) that allows selective attachment of the oligomer or polymer (protein or 

peptide) to a phosphate label, to an affinity label or to a solid phase material. 

Selective attachment of a phosphate group to its affinity labdpp^elective attachment 
to a solid support allows selective isolation of an oligomer op-ptSlymer (e.g., protein or 
peptide) that carries at least one phosphate group.Jh£presence of an affinity label allows 
capture of the selectively labeled oligomgp<jfpolymer using a capture reagent that 
specifically bonds to the affinityJgSel. The presence of a linker that allows selective covalent 
attachment of the oliep*rt6r or polymer to a solid surface allows physical separation of the 
selectively J^b^Ied oligomer or polymer from non-selectively labeled (non-phosphorylated) 
sg£©i€sin a sample. The method is particularly useful for the selectively labeling of proteins 



and peptides having a phosphate group (phosphoproteins andphespKopeptides) in the 
presence of carboxylic acid groups. The metho^ftKisinvention can be used specifically to 
separate phosphoproteins and pho§phoj5eptides from mixtures of non-phosphorylated proteins 
and/or nonphosphorylateffpeptides and thus to overcome detection problems due to low 
levels^xttitSse species in samples. 

In specific embodiments the method is applied to the separation, detection and 
identification of phosphopeptides and phosphoproteins in one or more samples. The method 
can simply be used to selectively introduce a phosphate label that allows detection of the 
presence of one or more phosphate groups in a peptide or protein. The method can also be 
used to selectively introduce an affinity label at a phosphorylation site in a peptide or a 
protein or selectively bond a phosphoprotein or phosphopeptide to a solid surface. 

When combined with well-known methods of mass spectrometry, the selective 
labeling method of this invention facilitates separation of phosphopeptides from mixtures and 
facilitates their detection by mass spectrometry and facilitating sequencing of the peptides by 
tandem mass spectrometry. Art-known methods can be applied to identify phosphoproteins 
in a sample from the sequences of phosphopeptides detected in a sample. When combined 
with methods for differential isotopic labeling, the method of this invention can be employed 
to quantify relative amounts of phosphopeptides and phosphoproteins in different samples. 
These quantitative methods allow a comparison of phosphorylation state in samples derived 
from different sources (e.g., from different cell-types or different organisms), samples that are 
differentially affected by a stimulus (e.g., administration of a drug or contact with a 
potentially toxic material), by a change in environment (e.g., nutrient level, temperature, 
passage of time) or by a change in condition or cell state (e.g., disease state, malignancy, site- 
directed mutation, gene knockouts) of a cell, tissue or organism from which the sample 
originated. The phosphoproteins identified in such a screen can function as markers for the 
changed state. Phosphopeptides and phosphoproteins from any naturally-occurring 
environment or artificially-controlled environment can be assessed by the methods herein. 
The method can be applied to mixtures of naturally-occurring proteins or peptides, as well as 
to mixtures of proteins or peptides derived from recombinant or synthetic methods. 
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The selective labeling method of this invention comprises the following steps as 
applied to phosphopeptides and phosphoprotein: 

(1) the carboxylic acid groups in proteins or peptides in one or more samples 
containing proteins are permenently selectively protected so that any 
phosphate groups in the proteins or peptides in the sample (s) are unprotected 
(remain as free phosphate groups); 

(2) the free phosphate groups in the proteins or peptides of the sample(s) are then 
selectively reacted with a label (e.g., a phosphate label, a radio label, isotope 
label, or an affinity label) or with a linker carrying a reactive group or a latent 
reactive group that will facilitate selective attachment of phosphoproteins or 
phosphopeptides to a label or to a solid surface; and 

(3 a) selectively labeled proteins or peptides are detected using the presence of the 
label as a measure of the presence of at least one phosphate group; or 

(3b) selective attachment of peptides or proteins to an affinity label or to a solid 
surface to facilitate selective isolation of phosphopeptide and /or 
phosphoproteins separated from proteins and peptides that do not have a 
phosphate group. 

In a preferred embodiment, selective labeling of phosphate groups is accompli^heaby 
initial reaction of the proteins and peptides in one or more samples with a projpetfve group, 
such as an amine, that reacts in the presence of a condensation catalyspflrfth and protects both 
carboxylic acid groups and phosphate groups. Amines react wittfcarboxy lie acid (or the 
corresponding esters) to form carboxyamidate bond^Afmnes react with phosphates or 
phosphate ester groups to form phosphoramki^onds. The labeled phosphoramide bonds in 
the protected proteins and peptides arp^tKen selectively cleaved with a reagent that does not 
cleave carboxyamide bonds^Pttis results in the regeneration of free phosphate groups which 
can be selectively lalj^tecf or linked to a solid surface. In a specific embodiment, an amine, 
such as ethapotemine, can be used for initial protection of all carboxylic acid and all 
phq§ph£te groups. For example, a carbodiimide catalyzed condensation of an amine with the 
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peptide or protein forms amide and phosphoramide bonds. Ethanolamine can be selectively 
cleaved from the phosphate groups of the protein or peptide by treatment witl^'ifnld acid 
conditions (e.g., trifluoroacetic acid (tfa) 5 10-30% by volume in waten^femplifying mild 
acid conditions). Excess protecting agent (e.g., excess amine) caprtfe removed by extensive 
washing of the peptides on a reverse phase column. In sn@tfmc embodiments, the free 
phosphate is reacted with a linker group that carrieg^reactive functional group including a 
latent reactive group (such as a sulfhydryl gf&(xp) that can be used to attach the 
phosphoprotein or phosphopeptide to^olid support or used for selective labeling of 
phosphopeptides and phosphop*dfeins with a phosphate label. 

For example, carbodiirmd€-catalyzed condensation reactions can be used to attach cystamine 
to free phosphate^groups. The disulfide bond in the cystamine can be cleaved to generate a 
reactive sulffiydryl group. (Cystamine is an example of a group that carries a latent reactive 
groppX 

Phosphopeptides or phosphoproteins that have been selectively separated and isolated 
from other proteins or peptides in a sample are cleaved from the affinity label or solid support 
and analyzed by conventional mass spectrometric techniques including tandem mass 
spectrometry to detect, identify by sequencing or quantitate the phosphopeptides or 
phosphoproteins in one or more samples. 

Proteins and/or peptides in different samples can be differentially isotopically labeled 
to facilitate comparison of amounts of the same peptide or protein in different samples. 
Isotopic labeling is typically introduced at the carboxylic acid protective groups (e.g., at 
amine groups such as ethanolamine). 

In this method, amine groups in the proteins or peptides are preferably also treated 
with a protective group selective for reaction with the amine side groups of amino acids (e.g., 
the s-amino group of lysine side chains, and/or peptide a-amino groups). This treatment 
controls crosslinking of amine side chains during sample preparation. If sulfhydryl linker 
groups are employed, it is preferred that the samples be treated with a reagent that reduces 
disulfide bonds. Selectively labeled samples are also optionally treated with hydroxyamine 
to remove tyrosine adducts that may have formed during sample preparation. 
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This invention also provides kits for selective labeling of phosphate groups which 
contain reagents needed to carry out selective labeling and optionally contain positive and 
negative controls for use with the kit reagents. A reagent kit comprises a carboxylic 
acid/phosphate reactive protective group and a label or linker group that is to be selectively 
bonded to the phosphate groups in the phosphoproteins or phosphopeptide. The kit also 
contains any catalysts or condensation agents needed to facilitate the reaction (e.g., 
carbodiimide). In addition, the kit optionally includes a reagent for selective cleavage of 
phosphoramide bonds in the presence of amide bonds (e.g., a dilute acid that can be used to 
generate the mild acidic conditions for selective cleavage). Kits can include reagents in 
premeasured aliquots for assay of a selected number of samples. 

The label may be an affinity label and, if so, the kit preferably contaiptfa capture 
reagent appropriate for use with the affinity label. A kit optionally contains protective groups 
for amines (e.g., t-boc or f-moc), and solid phase materials. TheJ*ii may further contain a set 
of differentially isotopically labeled protective groups, linkpr^ affinity labels, or other labels 
(fluorescent, chromophoric or phosphorescent) to alloyt^quantitative determination of the 
amounts (or relative amounts ) of phosphoprotejns and phosphopeptides in different samples. 
With respect to fluorescent, chromophoric^fadiolabels or other labels, different types of 
labels can be used to label the phospji^tes in different samples. For example, different 
fluorescent labels that are separately detectable and can be individually quantitated (e.g., 
fluorescein amine, rhodaprfne amine) can be used to label different samples and to detect 
relative amounts oOioeled peptides in difference samples. Kits further optionally contain 
instructions fprxarrying out selective labeling, as well as, directions for conducting various 
types opdnalysis that can be used in combination with the kit to detect, identify, or quantitate 
p)K$sphopep tides and phosphoproteins. 

In specific embodiments this invention provides: 

A method for selectively labeling or tagging phosphate groups in one or more natural 
or synthetic peptides or proteins in the presence of one or more carboxylic acid groups by 
reacting the natural or synthetic peptides or proteins with a protective group that reacts to 



protect the phosphate groups therein by forming phosphoramide bonds and to protect the 
carboxylic acid groups therein by forming amide bonds; thereafter treating the protected 
peptide or protein under conditions which selectively substantially cleave the phosphoramide 
bond, without substantially cleaving the amide bond to regenerate free phosphate groups in 
the peptide or protein; and reacting the free phosphate groups in the peptide or protein, in 
which the carboxylic acids groups remain protected, with a label or tag comprising a 
functional group that reacts with a phosphate or with a linker that comprises two or more 
functional groups that function to bond a phosphopeptide or phosphopeptide to a solid 
support. 

Reagents which substantially cleave one bond in the substantial absgoe^^of cleavage 
in the other exhibit at least about a 10:1 ratio of cleavage of oneJ^etfEfto another (measured in 
terms of the rate of reaction or the amount of cleavage-pftxiuct detected) and preferably 
exhibit at least about a 20:1 ratio, and moje^tfeferably, at least about a 100:1 ratio of cleavage 
of one bond to the other. Ofcpurggibr applications to the methods herein, the reagents for 
selective cleavage^b^nds are preferably chosen to cleave one bond without any measurable 
cleavage<yfthe other bond. 

In this method the phosphopeptide or phosphoproteins canJj@^5ovalently attached to a 
solid support material through reaction with a sulfhydrjd-gfoup of the linker and the solid 
support can comprise immobilized iodoaceljd~gfoups for reaction with sulfhydryl groups. In 
this method phosphopeptides^pph^sphoproteins an be separated from a mixture by 
attachment to a solid--Stipport or by binding of the phosphopeptides to a capture reagent via an 
affinrty^Bel. 

A method for detecting one or more phosphopeptides in one or more samples 
containing a mixture of peptides by selectively protecting the carboxylic acid groups of the 
peptides in the one or more samples such that any phosphate groups in the peptides remain 
unprotected; selectively labeling the unprotected phosphate groups in the peptides in the 
sample with a label having a functional group that reacts directly or indirectly with a 
phosphate; detecting the peptides carrying the label to detect the phosphopeptides in the 
sample. The label can be a radiolabel, an isotope label, a fluorescent label, a colorimetric 
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label or an affinity label, the label can also be a reactive label which carries at least one 
reactive group or at least one latent reactive group. A latent reactive group is a group that 
must be activated for reaction, e.g., it can be a group that carries a protective group and which 
becomes reactive on removal of the protective group. 

In this method tandem mass spectrometry can be used to determine the amino acid 
sequence of peptides and the precise position of the phosphorylated amino acid within the 
peptide sequence. Quantitation of the relative amounts of phosphopeptide can be 
accomplished by use of differentially isotopically labeled labels or tags. Tandem mass 
spectrometry can also be used to detect one or more phosphopeptides in a sample and 
determine the relative amounts of one or more phosphopeptides in the two or more samples 
by measuring the relative amounts of differentially isotopically labeled labels or tags present 
in the one or more samples. 

The method also provides kits for selectively labeling phosphopeptides in a mixture 
of peptides by reacting the peptides in one or more samples with a protective group which 
reacts with a carboxylic acid or ester thereof and which also reacts with a phosphate group; 
and employing an acid reagent for selectively regenerating free phosphate groups in the 
peptide by reacting the protected peptides under sufficiently mild acid conditions such that 
the phosphoramide bond is substantially cleaved and the amide bond is substantially not 
cleaved. The kit can further comprise any one or more of the following: a radiolabel, a stable 
isotope label, a fluorescent label, a colorimetric label, an affinity label, a capture reagent with 
a corresponding affinity label, a reactive label, protective groups for amine groups, one or 
more solid supports, an iodoacetylated solid support, one or more enzymes for carrying out a 
protein digest; and reagents for carrying out the various enzymatic or chemical reactions of 
the detection or separation methods herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a scheme illustrating the chemistry of phosphopeptide/phosphoprotein 
labeling and purification of this invention. 
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Fig. 2 is a flow chart illustrating quantitative comparison of phosphopeptides and 
phosphoproteins in two different samples. 

Fig. 3 illustrates the results of mass spectrometric analysis of differentially labeled 
samples of phosphoangiotensin as described in Example 1 . 

Figs. 4A-D illustrate the results of application of the phosphopeptide isolation 
strategy of this invention to the phosphoprotein p-casein. A tryptic digest of p-casein was 
analyzed by LC-MS/MS both before (Figs. 4A and 4C) and after (Figs. 4B and 4D) 
phosphopeptide isolation according to the procedure of this invention. 10 pmol of starting 
material was used for phosphopeptide isolation. Fig. 4A is an ion chromatogram of 1 pmol of 
P-casein digest before phosphopeptide isolation. The peak at m/z =1031.6 represents the 
doubly charged form of the expected phosphopeptide of the P-casein tryptic digest. Fig. 4B 
is an ion chromatogram of the isolated phosphopeptides of the p-casein tryptic digest. The 
peak at m/z = 1 182.5 represents the doubly charged form of the same tryptic phosphopeptide 
from p-casein, indicated in Fig. 4A, but additionally modified on its seven carboxylate groups 
with ethanolamine. Fig. 4C is the CID spectrum of P-casein digest in Fig. 4B. The peak at 
m/z = 938.0 represents the doubly charged form of the selected parent ion (m/z = 1031 .6) 
minus the H 3 P0 4 group. Fig. 4D is the CID spectrum of isolated phosphopeptides of the p- 
casein digest in Fig. 4C. Again, the peak at m/z = 1 133.6 represents the double charged form 
of the selected parent ion (m/z = 1 182.5) minus H 3 P0 4 , and the y-ion series used for peptide 
identification is indicated. The b-ion series are much less intense and are omitted for clarity. 

Figs. 5A and 5B illustrate phosphopeptide isolation from the Lck tyrosine-myelin 
basic protein (MBP) kinase reaction mixture. Fig. 5 A is an LC-MS ion chromatogram of the 
phosphopeptides isolated from a tryptic digest of protein mixture generated from in vitro 
kinase reaction between Lck and MBP. Fig. 5B is a CID mass spectrum of the most intense 
ion (m/z = 630.1, 2+ ion). This peak was subjected to CID analysis and database searching, 
which identified the peptide as THY*GSLPQK from MBP, with the tyrosine residue being 
phosphorylated. 
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Figs. 6 A-C illustrate the results of phosphopeptide isolation from a yeast cell lysate. 
Fig. 6 A is an LC-MS chromatogram of the phosphopeptides isolated from a tryptic digest of 
whole yeast cell lysate. Fig. 6B is an integrated mass spectrum of ions eluting from the LC 
column with retention times between 24.7 and 26.5 min, as indicated in Fig. 6A Major ion 
peaks that additionally exhibited a loss of 98 Da on CID, indicating that they are 
phosphopeptides, are annotated with an asterisk (*). Fig. 6C is the CID spectrum recorded 
for the peptide peak indicated in Fig. 6B at m/z ± 1032.7. This spectrum was sufficient to 
identify the phosphopeptide as TAGIQIVADDLT* VT*NPAR from enolase. However, the 
exact site of threonine phosphorylation was not unambiguously defined because of the 
difficulty in assigning y5 and y6 ions. Both potential locations for the phosphate have thus 
been indicated (*), although the parent ion mass confirms the peptide as a singly 
phosphorylated species. 
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DETAILED DESCRIPTION OF THE INVENTION , 
This invention provides methods for detecting the presence of phosphpjfeptides and 
phosphoproteins in a mixture, identifying phosphopeptides and phosphoproteins that are 
present in a mixture and determining the relative amounts of phosnjadpeptides and 
phosphoproteins in one or more mixtures. The methods are b^ed on the ability to selectively 
from covalent bonds to phosphate groups of peptides injhe presence of carboxylic acid (or 
ester) groups and amine groups of peptides. The mdfhods are more specifically based on the 
ability to selectively bond a label or linker toXphosphate group in the presence of carboxylic 
acids. In particular, the method relies on'the ability to cleave phosphoramide bonds under 
mild acid conditions which do nojx^feave amide bonds. Thus, selective labeling or linking to 
a phosphate group in the mejhod proceeds by initial conversion of peptide and protein 
carboxylic acid groups^f the C-terminus and amino acid side groups) to amides and the 
conversion of phosphate groups of peptides and proteins to phosphoramidate. Thereafter, the 
phosphoramidate are selectively cleaved without cleaving the amides and the free phosphate 
groups^are reacted with selected labels or linkers to facilitate detection, identification and 
quantitation of phosphopeptides in one or more samples. 

A preferred method for forming amide and phosphoramide bonds is by condensation 
with free amines. Condensation can be accomplished using various condensation catalysts 
known in the art, but the use of carbodiimide is a preferred method. In general any amine can 
be employed but alkanol amines, such as ethanolamine are preferred. The amine can function 
simply as a protecting group to facilitate selective reaction with phosphate groups, it can 
carry a detectable label, e.g., carry a group or moiety that can be detected, such as a 
radiolabel, a fluorescent label or the like, or it can carry a group or moiety that can react (a 
reactive functionality), a group that can be made to react (a latent reactive group) or that can 
form a bond or complex to another species (e.g., an affinity label which will bond or complex 
to a capture reagent). 

Phosphoramide bonds are selectively cleaved in the presence of amide bonds by use 
of mild acid conditions. Both the strength of the acid and the length of time of expose to 
acidic conditions can be controlled to obtain selective cleavage of the phosphoramide bonds. 
A preferred treatment uses dilute trifluoroacetic acid, e.g., 10% or less by volume in water for 
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a selected time up to several hours. Thereafter the free-phosphate group can be reacted with a 
variety of labels or linkers. For example, the presence of a phosphopeptide in a sample can 
be detected by the presence of a selected label that has been selectively attached to a 
phosphate group. Phosphopeptides can be separated from non-phosphorylated peptides by 
selective attachment to a solid support, for example, via a linker group that is selectively 
attached to the phosphate group. Selective attachment of phosphopeptides to a solid support 
can be used to isolate and purify phosphopeptide in one or more samples and facilitate their 
detection and identification by analytical methods, particularly by mass spectrometry. 

In a specific embodiment of the methods herein, a peptide mixture, which may be 
generated from a variety of natural or synthetic sources, is treated to protect amine groups in 
the peptides (N-terminal as well as those in amino acid side groups). Amines are preferably 
protected using t-Boc chemistry as is known in the art. Carboxylic acids and phosphate 
groups of amine-protected peptides are then condensed with free amine, preferably 
ethanolamine (which may be isotopically labeled) to form amides and phosphoramidates, 
respectively. Differential isotopic labeling of different samples of peptides is accomplished 
by treating different samples with differentially labeled amines, e.g., non-deuterated 
ethanolamine can be used for one sample and deuterated ethanolamine can be used for a 
second sample. Other stable isotope encoded amine reagents can be used. 

The peptides which are amine-protected and in which the carboxylic acid gpatfps and 
any phosphate groups have been converted to amides and phosphoramidatesffespectively, are 
then treated under mild acid conditions to selective cleave the pho^pftoramide bonds leaving 
e amide bonds substantially intact. Any phosphopeptide^hfa sample carry free phosphate 
groups and the phosphopeptides in different samplps^e differentially isotopically labeled. 
The phosphopeptides in each sample can tfjsrfoe selected attached to a solid support, via a 
sulfhydryl linker generated by conjierfsation of the phosphate groups with cystamine. The 
attached cystamines are th^j^feduced to generate free sulfhydryl groups which can react with 
iodoacetyl groups on^solid support. The cystamine functions as a latent reactive group 
which is activated for attachment to the solid support by reduction (e.g., with dithiothreitol, 
Dp>dftris[2-carboxlethylphosphine], TCEP). 
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Phosphopeptides in samples are attached to the solid support and after extensjyer 
washing are cleaved off the support (e.g., using trifluoroacetic acid). Preferaj^ljvthe amine 
protective groups are cleaved in this reaction as well. These steps prpvicfe purifies, 
isotopically labeled phosphopeptides which ca be subjectedjp'tandem mass spectrometric 
analysis. CID mass spectra provide the sequence o£arly shosphopeptide present in a sample 
and indicate the presence and location of pljo5f5horylated amino acid residues. Using the 
peptide sequence information obtaipefv database searching can be performed to determine the 
protein source of the phosphdpep tides detected. The relative signal intensities of 
differentially isotapi<5ally labeled phosphopeptides in different samples detected in mass 
scans in tl>@*£ame mass spectrometer allow measurement of the ratio of abundance of the 
labeled phosphopeptides in different samples. 

The methods of this invention have several advantages. Amine groups of the peptides 
are protected and do not form amide bonds with the carboxylic acid groups of other peptides 
minimizing peptide cross-linking and other artifactual reactions. The carboxylic acid groups 
remain protected and thus can provide for differential isotope labeling. 

Covalent linkage of the amine labeling the phosphate group to a solid suppojHs a 
preferred implementation of the method of this invention because it allows stra^ent washing 
of the immobilized phosphopeptides and their specific release by acid treatment. The use of 
cysteine to label phosphate groups is preferred because any peptidp^with residual unblocked 
carboxylic acid groups, whether it contains phosphate or not^^ill be converted into 
sulfhydryl groups and remain attached to the solid suppefft irreversibly. The presence of 
cysteine residues interferes with the above metho^fhowever, cysteine groups can be 
optionally alkylated by iodoacetamide of another known alkylating reagent. Before any 
protein sample is subjected to this method of phosphopeptide isolation, it can be reduced by 
dithiothreitol, and then alkylate$H>y excess of iodoacetamide under denaturing condition. 
The solid phase- based embodiment of the method of this invention therefore serves as an 
efficient way for hieWy specific phosphopeptide purification and the stable isotope tags 
introduced bvpdrboxyl group derivatization serves as the basis for relative phosphopeptide 
quantitation. Any solid material that can be derivatized with functional groups that facilitate 
atteefiment of the phosphopeptides (e.g., via sulfhydryl groups) and that is otherwise 



16 



relatively inert with respect to the peptides, reagents and washing^condkkmT^ method 
can be used. For example, any solidphas^jnatei^ useful in solid phase peptide 

synthesis canJ^e^mpkryHaTTHass beads are a preferred solid phase material. 



With respect to fluorescent, chromophoric, radiolabels or other labels, differenj^pes 
of labels can be used to label the phosphates in difference samples. For exanjpteTdifferent 
fluorescent labels that are separately phosphate and can be individu^lly^measured (e.g., 
fluorescein amine, rhodamine amine) can be used to label dji&rent samples and to detect 
relative amounts of labeled peptides in different sapprles. These labels can also be used to 
separate the peptides by RP-HPLC or CEj>a^illary electrophoresis) and to detect the relative 
amounts of peptides by fluorescenpe'measurement. To implement quantitative measurement 
of relative amounts of peptides in difference samples, it is preferred to calibrate the system to 
account for differp«6es in detection of the different labels. For example, is it preferred when 
using different fluorescent labels to calibrate for differences in quantum yields of different 

Figure 1 illustrates the selective labeling method of this invention. As illustrated^ 
peptides (1) are first reacted with an amine protective group (2), such as tBoc far 
Butoxydicarbonate). A variety of useful amine protective groups are knojyn in the art and 
readily available for application in this method. The protective grcmpi;elected must be 
compatible with other reagents used in the method. Peptides w*fh protected amine groups 

(3) are then treated with a reagent that reacts to protect cja^oxylic acid and phosphate groups 

(4) , such as an amine. An amine group will reactjwfth a carboxylic acid group to form an 
amide (-CO-NH-) bond or with a phosphate^group to form a phosphoramidate (-P0 2 -NH-) 
bond. The fully protected peptide (SVi^treated to selectively cleave the phosphoramide bond 
to remove protection from the oJ*5sphate group. Mild acid conditions (6) are used to 
selectively regenerate thepKosphate group. For example, the protected peptide can be treated 
with trifluoroacetic^eid (tfa) (about 30% or less by volume in water for about 1 hour, 
preferably abpnf20% in water for about 1 hour, and more preferably 10% for 30 min. at room 
temperakfre). Mild acidic conditions include the use of a strong acid, such as tfa, in diluted 
foprfiT Other mild acidic conditions that will achieve the desired chemistry can be determined 
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by routine experimentation which treatirientjs^uied^e-i^ cleavage 
and minimize amidg-cieavage: 



Protected peptides in which the phosphate protective groups have been remoy@cf(7) 
are treated to selectively label the free phosphate group. For example, a linl^pr€ontaining a 
sulfhydryl (8) or other reactive group can be selectively bonded to thp^Iiosphate groups. The 
linker group contains functionality for bonding to the phospljM^group (e.g., an amino group) 
and functionality for linking to a label or to a solid sjjpfSce. Alternatively, at this point in the 

ethod, a label (an affinity label or any phosphate label such as a fluorescent label or 
radiolabel) carrying functionality forharfding to the phosphate group can be directly attached 
to the phosphate. As illustrateiWtT Fig. 1, the functionality on the linker group may be a latent 
functional group whicl^frfust be activated prior to reaction to add a label or to covalent attach 
to a solid surfa©£: In the illustrated case, cystamine carries a latent sulfhydryl functional 
group t^S-S- bond) which is activated by addition of a reducing agent (9, such as TCEP or 
,PTT) that reduces disulfide bonds. 

Figure 1 illustrates covalent linkage of phosphopeptides via the sulfhydryl functional 
group on the linker. The peptides carrying the sulfhydryl group linker (10) are reacted with 
derivatized beads carrying iodoacetyl groups (11). A variety of types of solid materials are 
available for use in this method. Solid phase materials (in the form of beads, surfaces, layers, 
etc.) are derivatize to facilitate linkage. After reaction with phosphopeptide (10), remaining 
reactive groups on the solid can be capped or protected with an appropriate capping reagent 
(e.g., with DTT (12)). Phosphopeptides covalently linked to the solid support (13) can be 
physically separated from other peptides in the samples and any non-specifically attached 
peptide can be removed with washing. 

Figure 1 illustrates a step of treating the bound phosphopeptidgs-willTTiydroxylamine 
(14). More specifically, beads carrying bound peptid^Wjrlire washed and incubated in 1M 
hydroxylamine (14, pH 10.0) for abgiit-2^TfsTto regenerate tyrosine, since tyrosine residues 
can form adducts wjlh-ear5odiimide. This step is optionally, but preferred, when 
car^pdiiiTTicfe^ reagents are employed in the method, to regenerate tyrosines in the bound 
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ydroxylamine for about 30 min. is sufficient to regenerate tyrosine. 

Unbound peptides are removed by sequential washing with 5M sodium chloride, 
acetonitrile and water. The volume of each washing step will minimally consist of about 10 
column bed volumes, but larger volumes can also be employed. The bound phosphopeptides 
are then treated under strong acid conditions (15) to cleave the linker to the solid surface and 
to remove the amine protective group (2). The carboxylic acid groups of the free (i.e., 
released) phosphopeptide (16) are, however, still protected. These protective groups can be 
employed to carry differential isotopic labels that are useful for quantitative peptide analysis 
by mass spectrometry. It may also be desirable to employ conditions that will selectively 
cleave the phosphopeptide from the solid support without cleavage of the amine reactive 
protective group (2).. A variety of protective groups which are cleaved using a variety of 
different conditions are available in the art. One of ordinary skill in the art can select a 
protective group, a linker and cleavage conditions that will allow retention of an amine 
reactive protective group on cleavage of the linker from the solid support material. If the 
amine reactive protective group is retained after cleavage of the peptide from the solid 
surface, it can also be used for differentially isotopic labeling. 

Dried separated phosphopeptides are resuspended in water for LC-MS analysis 
Phosphopeptides selectively labeled and isolated by the method herein are preferably 
analyzed by mass spectrometric techniques. Protective groups and linkers used in this 
method are preferably selected so that any modifications to the peptide that are retained on 
release from the solid phase material do not significantly interfere with mass spectral analysis 
and sequencing of the peptide by tandem mass spectrometric methods. 

U.S. patent application serial number 09/383,062 filed August 25, 1999, and 
corresponding International Patent Application W099/19415, filed August 25, 1999, having 
inventors in common with this application, provide analytical reagents and mass 
spectrometry-based methods for the rapid, and quantitative analysis of proteins or protein 
function in mixtures of proteins. The methods employ reagents designated "affinity-labeled 
protein reactive reagents" that allow for the selective isolation of a peptide from complex 
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mixtures. The reagent contains an affinity label covalently linked through a linker group to a 
protein reactive group that selectively reacts with certain protein functional groups. The 
linker may be differentially isotopically labeled. The reagents and methods can be applied to 
the detection and identification of proteins in complex mixtures of proteins, where the 
peptides isolated by the method are characteristic of the presence of a protein in the mixture. 
Isolated peptides are characterized by mass spectrometric (MS) techniques. In particular, the 
sequence of isolated peptides can be determined using tandem MS (MS n ) techniques, and by 
application of sequence database searching techniques, the protein from which the sequenced 
peptide originates can be identified. The affinity-labeled protein reactive reagents can also 
provide for differential isotopic labeling of the isolated peptides to facilitate quantitative 
determination of the relative amounts of proteins in different samples and provide for 
internal standards to facilitate quantitative determination of the absolute amounts of one or 
more proteins present in a sample. The present invention provides a method for selective 
labeling of phosphopeptides in complex mixtures and their selective isolation that can be 
employed with the methods and applications described in U.S. patent application serial no. 
09/383/062 US and International Patent Application WO 99/19415. These patent 
applications are incorporated by reference herein in their entirety to the extent that they are 
not inconsistent with the disclosures herein, among other things for descriptions of 
differential isotopic labeling, mass spectrometric methods and application of the selective 
labeling methods described. 



The following references relate to the application of mass spectrometric^€chniques to 
protein identification, particularly with respect to proteome analysis: Idgk£r T, Thorsson V, 
Ranish JA, Christmas R, Buhler J, Eng JK, Bumgarner R, Goodlptt^DR, Aebersold R, Hood 
L "Integrated genomic and proteomic analyses of a systepatffically perturbed metabolic 
network." Science. 2001 May 4;292(5518):929-^l<dygi SP, Aebersold R. "Mass 
spectrometry and proteomics." Curr Opu^Cliem Biol. 2000 Oct;4(5):489-94.; Gygi SP, Rist 
B, Aebersold R "Measuring gen^e^cpression by quantitative proteome analysis" Curr Opin 
Biotechnol " 2000 AugOJ^396-401; Goodlett DR, Bruce JE, Anderson GA, Rist B, Pasa- 
Tolic L, Fiehn O^JSrmth RD, Aebersold R."Protein identification with a single accurate mass 
of a cysteipe<containing peptide and constrained database searching."Anal Chem. 2000 Mar 
15^72(6): 1 1 12-8.; and Goodlett DR, Aebersold R, Watts JD."Quantitative in vitro kinase 
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eaction as a guide for phosphoprotein analysis by masssg^ctFefriTeSy."Rapid Commun Mass 
Spectrom. 2000;14(5):344-8; Zhou, H^^Apf200\) Nature Biotechnol. 19:375-378. 
These reference are incprporSfedby reference herein to the extent that they are not 
incojisisteTlfwith the disclosure of this application. 



Peptide mixtures subjected to the method of this invention can be generated from 
natural or synthetic samples and may be the results of chemical, physical or enzymatic 
digestion of protein samples. Proteins can be digested using any enzymatic appropriate 
method, such as trypsin digestion. Peptides in the digest preferably range in size from about 
10 to about 50 amino acids in length and are more preferably sized to facilitate peptide 
sequencing using tandem mass spectrometric methods. Those of ordinary skill in the art can 
select a protein digestion protocol suitable for use in protein sample(s) of interest. 

Amines are preferred reagents for selective labeling of carboxylic acidsiffme 
presence of phosphate groups. Any amine reagent will generally providp^tfie function of this 
protective group. Alkanolamine, such as ethanolamine are preferrcd'itmine reagents. One of 
ordinary skill in the art will appreciate that other reagents caj><5e found which provide a 
similar selective labeling function. One or ordinary skiHin the art can identify and selection 
other reagents for selectively labeling without re^ft to undue experimentation. The 
protective group employed must, howevei>tffso be suitable for use in the reactions of this 
invention. Any art-known methods^rfa reagents suitable for selective protection of 
carboxylic acid groups as despri^ed herein are intended to be encompassed by this invention. 

With respect to amine^tfiat are used to selectively protect carboxylic acid groups, the reaction 
of amines with c#r1x)xylic acids and phosphate groups is preferably done in the presence of a 
coupling ag^nt. Coupling agents that can be used in this reaction include, among others, 
dicyplt&exylcarbodiimide, or 2,3,5,6-tetrafluorophenyl trifluoroacetate. In addition, a 
Coupling catalyst such as 4-dimethylaminopyridine can be employed. 



The selective labeling method of this invention employs a treatment that selectively 
removes protection from phosphate groups, but not carboxylic acid groups. In particular, the 
protected peptides are treated with acidic conditions that cleave phosphoamidate bonds, but 
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not amide bonds (herein mild acidic conditions). Treatment under these mild acidic 
conditions cleaves the phosphoamide bonds between phosphate and ethanolamine without 
deprotecting the amine and carboxylic acid groups of the peptide. For example, tBoc 
protection remains mostly intact, unless prolonged acid treatment is involved. Those of 
ordinary skill in the art will appreciate that other treatment conditions may be found which 
provide for a similar function. One or ordinary skill in the art can identify and select other 
reagents for selectively removal of protecting groups on phosphates without resort to undue 
experimentation. Any such methods and reagents known in the art which achieve the 
selective removal of phosphate protective groups as described herein are intended to be 
encompassed by this invention. 



If desired, selectively labeled peptides can be attached to a solid support by attaching 
a linker group carrying an appropriate functional group for bonding to the support. 
Attachment of phosphopeptides to solid supports is exemplified by attachment via sulfhydryl 
group reaction with iodine. Those of ordinary skill in the art appreciate that functional 
groups other than sulfhydryl and iodine can be used to complete a linkage to a solid support 
material. A variety of methods for making such attachments are known in the art. Any 
method and reagents that achieve the function of selective attachment of the phosphopeptide 
to a solid support are intended to be encompassed by this invention. 



The method of this invention as specifically exemplified emgloys^steps of washing 
peptides on Reverse Phase columns to remove undesire^iaafenals from the peptide sample. 
Those of ordinary skill in the art will apprgciat€mat methods for removing such materials 
other than those specificallyd^serfGed herein are known in the art and can be readily applied 
to the method hepskrtoachieve the desired result. All such art-known methods for washing 
or rgpaer^aTof undesired materials are intended to be encompassed by this invention. 



A strategy for quantitative, comparative analysis of proteinphQspherytSfion in one or 
more samples is illustrated in Figure 2. Peptide^amples*^reprepared from two cell states (1 
and 2). As an example, diffej^ntiall5rTs^opic labeled carboxylic acid/ phosphate group 
amine pr^iteetiv^reagents (one for each sample, e.g., dO- or d4-ethanolamine for each of two 
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samples) are used to differentially isotopically label all carboxylic acid groups and initially 
any phosphates in peptide samples. Carboxyl groups include the C-terminus of a peptide and 
the side chains of glutamic acid and aspartic acid residues (as well as any carboxylic acid 
side groups from rare or non-naturally-occurring amino acids). Ethanolamine groups 
protecting the phosphate groups are selectively removed to generate free phosphate groups. 
Free phosphate groups are then derivatized with a linker that facilitated separation of 
phosphopeptides. For every phosphopeptide ultimately purified using this method, there is at 
least one labeled protective group (e.g., ethanolamine) attached to the C terminus of the 
peptide. Dependent upon its structure, a given phosphopeptide may have more than one 
labeled protective group. 

The mass difference between differentially isotopically labeled peaks will depend on 
the isotopic mass difference among the labels and the charge state of the peptides, which can 
be determined in the mass spectrometer itself based on the natural isotope distribution. Since 
isotopic-related peptides essentially co-elute from a microcapillary high performance liquid 
column chromatography (HPLC) run, as they are analyzed by the mass spectrometer, 
multiplets of peaks due to a given peptide appear for each differentially labeled peptide (e.g., 
a doublet for two samples labeled differentially with dO- and d4-ethanolamine). The relative 
intensity of the peaks in the multiplet (e.g., doublet) of peaks from the same peptide in 
differentially isotopically labeled samples directly yields the relative concentrations of that 
peptide in the different samples. The underlying principle of this quantification method is 
that isotopically related peptides are chemically identical and therefore represent a perfect 
mutual internal standard. The intensities of the signals generated in the mass spectrometer 
from the differentially isotopically labeled peptides from different samples precisely reflect 
relative quantities of the peptide molecules present in those samples, respectively. 

The sequence of a phosphopeptide and the identificationjaf : til^site(s) of 
phosphorylation can be determined by a combnaatkyTToftandem mass spectrometry and 
computer- assisted database seai^-pfograms, such as SEQUEST (Trademark, University of 
Washington^Seattte^A) (McCormack, A.L. et al. (1996) "Direct Analysis and Identification 
pf-Pfoteins in Mixtures by LC/MS/MS and Database Searching at the Low-Femtomole 
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Level", Anal. Chem. 69, 767-776; Eng, J.K. et al. (1994)"An Approach to Correlate Tapdem 
Mass Spectral Data of Peptides with Amino Acid Sequences in a Protein Database/^. Amer. 
Soc. Mass. Spectrom., 5, 976-989; U. S. Patent 5,538,897 (Jul. 23, 1996) YajeC III et al.) In 
the first stage of a tandem mass spectrometer, any given phosphopeptide/fs selected and 
subjected to collision induced dissociation (CID). The spectra of a^sulting fragment ion is 
recorded in the second stage of the mass spectrometry, as a so^ealled CID spectrum. This 
process is repeated with other (ideally all) peptides preseplin the sample. Because the CID 
process usually causes fragmentation at peptide bonds and different amino acids for the most 
part yield peaks of different masses, a CDI speefrum alone often provides enough information 
to determine a peptide sequence. Peptide^equencing and protein identification is facilitated 
by using a sequence searching commrfer program, such as SEQUEST™, which takes all 
known genomic sequence, computes all possible theoretical CID spectra and compares them 
to experimental CID specjra for matches and sequence identification. The mass modification 
to the C-terminus, ghjfamic acid, aspartic acids and any other acidic side groups are known 
and this inform^tfon can be incorporated into the computer analysis. Also mass changes due 
to phosphorylation are also known and can be incorporated into the computer analysis. Data 
can b^searched for any possible phosphorylations to serine, tyrosine, and threonine residues, 
thus allowing the identification of sites of phosphorylation. 

The methods of this invention can employ protective groups which are isotopically 
labeled to generate pairs or sets of reagents that are substantially chemically identical, but 
which are distinguishable by mass. For example a pair of protective group reagents, one of 
which is isotopically heavy and the other of which is isotopically light can be employed for 
the comparison of two samples, one of which may be a reference sample containing one or 
more known proteins in known amounts. For example, any one or more of the hydrogen, 
nitrogen, oxygen or sulfur atoms in the protective group may be replaced with their 
isotopically stable isotopes: 2 H, 13 C, 15 N, I7 0, ls O or 34 S. Differential isotopic labeling is 
preferably introduced into the peptides of this invention in the carboxylic acid protective 
group. 
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The methods of this invention can use affinity labels or phosphate labels for the 
detection or separation of phosphopeptide. The methods of this invention can use any 
phosphate label, including, but not limited to radiolabels, fluorescent labels, colorimetric 
labels, etc. The label is selectively attached to a phosphate in a phosphopeptide in the sample 
and its detection detects the presence of the phosphate. 

Suitable affinity labels bind selectively either covalently or non-covalently and with 
high affinity to a capture reagent (CR). The CR-A interaction or bond should remain intact 
after extensive and multiple washings with a variety of solutions to remove non-specifically 
bound components. The affinity label binds minimally or preferably not at all to components 
in the assay system, except CR, and does not significantly bind to surfaces of reaction 
vessels. Any non-specific interaction of the affinity label with other components or surfaces 
should be disrupted by multiple washes that leave CR-A intact. Further, it must also be 
possible to disrupt the interaction of A and CR to release peptides, substrates or reaction 
products, for example, by addition of a displacing ligand or by changing the temperature or 
solvent conditions. Preferably, neither CR or A react chemically with other components in 
the assay system and both groups should be chemically stable over the time period of an 
assay or experiment. The affinity label is preferably soluble in the sample liquid to be 
analyzed and the CR should remain soluble in the sample liquid even though attached to an 
insoluble resin such as Agarose or controlled pore glass. With respect to CR, the term 
soluble means that CR is sufficiently hydrated or otherwise solvated such that it functions 
properly for binding to A. CR or CR-containing conjugates should not be present in the 
sample to be analyzed, except when added to capture A. Affinity labels useful in this 
invention contain functionality that allows them to be bonded (preferably via a covalent bond 
which can be selectively cleaved, if and when desired) to a phosphate group. 

Examples of A and CR pairs include: 

d-biotin or structurally modified biotin-based reagents, including d-iminobiotin, 
which bind to proteins of the avidin/streptavidin, which may, for example, be used in the 
forms of strepavidin- Agarose, oligomeric-avidin- Agarose, or monomeric-avidin- Agarose; 
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any 1,2-diol, such as 1,2-dihydroxyethane (HO-CH 2 -CH 2 -OH), and other 1,2- 
dihyroxyalkanes including those of cyclic alkanes, e.g., 1,2-dihydroxycyclohexane which 
bind to an alkyl or aryl boronic acid or boronic acid esters , such as phenyl-B(OH) 2 or hexyl- 
B(OEthyl) 2 which may be attached via the alkyl or aryl group to a solid support material, 
such as Agarose; 

maltose which binds to maltose binding protein (as well as any other sugar/sugar 
binding protein pair or more generally to any ligand/ligand binding protein pairs that has 
properties discussed above); 

a hapten, such as dinitrophenyl group, for any antibody where the hapten binds to an 
anti-hapten antibody that recognizes the hapten, for example the dinitrophenyl group will 
bind to an anti-dinitrophenyl-IgG; 

a ligand which binds to a transition metal, for example, an oligomeric histidine will 
bind to Ni(II), the transition metal CR may be used in the form of a resin bound chelated 
transition metal, such as nitrilotriacetic acid-chelated Ni(II) or iminodiacetic acid-chelated 
Ni(II); 

glutathione which binds to glutathione-S-transferase. 

The covalent attachment of A to CR can be accomplished, for example, by the 
reaction of iodoacetamide on CR with a sulfhydryl group on A. 

In general, any A-CR pair commonly used for affinity enrichment which meets the 
suitability criteria discussed above can be employed. Biotin and biotin-based affinity tags are 
preferred. Of particular interest are structurally modified biotins, such as d-iminobiotin, 
which will elute from avidin or strepavidin columns under solvent conditions compatible 
with ESI-MS analysis, such as dilute acids containing 10-20% organic solvent. It is expected 
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that d-iminobiotin tagged compounds will elute in solvents below pH 4. d-Iminobiotin 
tagged protein reactive reagents can be synthesized by methods described herein for the 
corresponding biotin tagged reagents. 

A displacement ligand, DL, is optionally used to displace A from CR. Suitable DLs 
are not typically present in samples unless added. DL should be chemically and 
enzymatically stable in the sample to be analyzed and should not react with or bind to 
components (other than CR) in samples or bind non-specifically to reaction vessel walls. DL 
preferably does not undergo peptide-like fragmentation during MS analysis, and its presence 
in sample should not significantly suppress the ionization of tagged peptide, substrate or 
reaction product conjugates. 

DL itself preferably is minimally ionized during mass spectrometperlinalysis and the 
formation of ions composed of DL clusters is preferably minimaL^Ffie selection of DL, 
depends upon the A and CR groups that are employed. L^ggneral, DL is selected to displace 

from CR in a reasonable time scale, at most wijhiffa week of its addition, but more 
preferably within a few minutes or up to^rfiour. The affinity of DL for CR should be 
comparable or stronger than the affifuty of the tagged compounds containing A for CR. 
Furthermore, DL should)>e^oluble in the solvent used during the elution of tagged 
compounds contajirifig A from CR. DL preferably is free A or a derivative or structural 
modificatjatfof A. Examples of DL include, d-biotin or d-biotin derivatives, particularly 
thps€containing groups that suppress cluster formation or suppress ionization in MS. 

The method of this invention can employ linker groups which bond to phosphate 
groups to attach the phosphopeptide to a solid support. A linker can also be used to attach an 
affinity label or phosphate label to a phosphopeptide. Any linker used should preferably be 
soluble in the sample liquid to be analyzed and it should be stable with respect to chemical 
reaction, e.g., substantially chemically inert, with components of the sample as well as with 
any other reagents used in the method. The linker when bound to the peptide should not 
interfere with the specific interaction of an affinity label with a CR and should bind 
minimally or preferably not at all to other components in the system, to reaction vessel 
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surfaces or CR. Any non-specific interactions of the linker should be broken after multiple 
washes. 

Samples that can be analyzed by methods of this invention include cell homogenates; 
cell fractions; biological fluids including urine, blood, and cerebrospinal fluid; tissue 
homogenates; tears; feces; saliva; lavage fluids such as lung or peritoneal lavages; mixtures 
of biological molecules including proteins, lipids, carbohydrates and nucleic acids generated 
by partial or complete fractionation of cell or tissue homogenates. 

The methods of this invention employ mass spectrometric and tandem mass 
spectrometric methods. While a variety of MS methods are available and may be used in 
these methods, Matrix Assisted Laser Desorption Ionization MS (MALDI/MS) and 
Electrospray Ionization MS (ESI/MS) methods are preferred. 

The method of this invention is illustrated and exemplified by application to the 
detection and identification of phosphopeptides in a phosphopeptide standard sample, 
(Example 2, Fig. 3A-C); an enzymatic digest of a single phosphoprotein, p-casein (Example 
3, Fig. 4A-4D); a protein phosphorylated in vitro at a tyrosine residue (Example 4, Fig. 5A- 
C), a digest of total lysate of yeast cells (Example 5, Figs. 6A-C and Table 1); and a digest of 
total lysate of Jurkat cells (Table 2). 

The following examples are intended to further illustrate the invention and are not 
intended to limit the invention. 
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EXAMPLES / 

Example 1 :Phosphopeptide Isolation Procedure / 

Peptide samples were dried, and then subjected to the method shown in Fig/1 A 
according to the following steps. 1) Peptide mixture was resuspended in 50% (y/v) of 0.1 M 
phosphate buffer (pH 1 l)/acetonitrile. 0.1 M of t-Butyl-dicarbonate (tBoc) was added for 4 
hours at room temperature. 2) Acetonitrile was removed under reduced pressure. Samples 
were made to 1 M ethanolamine, 25 mM N-hydroxysuccinimide (TvfHSVand 0.5 M of N,N'- 
dimethylaminopropyl ethyl carbodiimide ©HC1 (EDC) and incubated/2 hours at room 
temperature. 3) 10% trifluoroacetic acid (TFA) was added for 30 pfrinutes at room 
temperature. Longer treatment under these conditions did not detrimentally effect the results. 
Samples can be neutralized at this point, but neutralization Was found to have no significant 
effect on results. Samples were then desalted on and recovered from a CI 8 column (Waters 
Associates, Milford, MA WAT 023590) using elution/<vith 80% acetonitrile, 0.1% TFA. 4) 
Peptides were dried and redissolved in 1 M imidazole (pH 6.0). Imidazole is optional and 
was employed to inhibit possible carbodiimide aflduct formation with sensitive amino acids 
such as histidine. 0.5 M EDC was added forX hours at room temperature. Samples were 
loaded on a CI 8 column, washed with wa£er and treated with 1 M cystamine (pH 8.0) for 2 
hours at 50 °C on the column. Peptides/were washed with water and reduced with 10 mM 
DTT to generate free sulfhydryl gratips. 5) After washing to remove DTT, peptides were 
eluted with 80% acetonitrile, 0.1 TFA and incubated with 20 mg beads with immobilized 
iodoacetyl groups for at least ^hours at pH 8.0 (titrated with 1 M Tris pH 8.0, 50 mM 
EDTA). Beads with immomlized iodoacetyl groups were prepared by a 2-hour reaction 
between 3 equivalents of Iodoacetic anhydride and 1 equivalent of amino beads (Sigma, 
G4643) with 3.3 equivalent of diisopropylethylamine in dimethylformide. The formation of a 
tyrosine adduct wjtti carbodiimide is a possible side reaction. Such an adduct is unstable 
against nucleopniles such as hydroxylamine. Therefore, after attachment of phosphopeptide 
to the beads/1 M hydroxylamine (pH 10) was used to incubate beads for 2hours at room 
temperature. This restored tyrosine residues. It has been found that treatment with 5% 
hydroxylamine solution for 30 min. is typically sufficient to restore tyrosine residues. Beads 
were then washed sequentially with 2 M NaCl, methanol and water to remove nonspecifically 
bound molecules. 6) The beads were incubated with 100% TFA for 30 minutes to recover 
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phosphopeptides. Concurrently, tBoc protection was removed. The recovered sample was 
dried under reduced pressure and resuspended in water for LC-MS/MS analysis. 

Example 2 

Two separate samples of equal amounts of phosphoangiotensin peptide were analyzed 
by the method of this invention. The carboxylic acid groups in the two differeijWamples 
were blocked (leaving phosphate groups free as described above) by eithpHight ethanolamine 
(dO-ethanolamine) or heavy ethanolamine (d4-ethanolamine, HOCp^CD 2 NH 2 ). 
Phosphoangiotensin contains two carboxylic acid groups, sojfllat the mass difference for the 
[M+2H] 2+ ion is 4 for the differentially labeled peptid^^The results of mass spectrometric 
analysis of the differentially labeled samples thajxc^ere subjected to selective labeling and 
separation of phosphopeptides of this invention is illustrated in Figs. 3A-C. A doublet of 
peaks [M+2H] 2+ at m/z^ 607 and 6p<aue to light and heavy labeled samples, respectively, is 
observed as expected. Furthep<Iie relative ratios of the two peaks is about 1 : 1 as expected. 
The CID spectrum of e^en of these peaks is similar to that of the unprotected peptide, except 
for the fragment iafis that are mass shifted by the modification (label attachment). 
Modificatiof^to the carboxylic acid groups used to achieve differential labeling do not 
advej^ty affect the quality of the CID spectrum that would be used to identify the sequence 
df unknown peptides. 

Phosphopeptide isolation was performed essentially as in Example 1 with minor 
modifications. Peptides were suspended in 50% (v/v) acetonitrile and 0.1 M potassium 
phosphate buffer (pH 1 1). t-Butyloxy-carbonyl (t-Boc, 1M) was added for 4 h at room 
temperature. Acetonitrile was removed under reduced pressure. The amine-protected peptide 
was dried under reduced pressure and resuspended in 1 M ethanolamine HC1. The pH of the 
solution was adjusted to approximately 6 by addition of 50 mM N-hydroxy succinimide 
(NHS). N,N'-demethylaminopropylethyl carbodiimide-HCl (EDC) was added (5 mg/50jaL) 
for 4 h at room temperature. Amine-protected peptide was labeled with the heavy 
ethanolamine (d4-ethanolamine, HOC0 2 CD 2 -NH 2 ) in a similar manner. 



Phosphates were selectively deprotected in the differentially isotopically labeled 
peptides by mixing each solution containing the labeled peptide with an equal volume of 20% 
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(v/v) trifluoroacetic acid for 1 h and thereafter neutralizing the reactions using 2 M sodium 
phosphate buffer. The neutralized solution was loaded on a reverse phase CI 8 column and 
washed extensively with water. The peptide with regenerated phosphate groups was eluted 
from the C-18 column using 80% (v/v) acetonitrile in water and then dried. Samples of 
heavy and light isotopically labeled peptide were combined for mass spectrometric analysis. 

An LCQ ion trap mass spectrometer (Finnigam MAT, San Jose, CA) was used with 
an HP 1 100 solvent delivery system (Agilent, Palo Alto, CA). Peptides were pressure-loaded 
onto the column, then eluted and analyzed by microcapillary LS-MS/MS as described in 
Gygi, S.P. et al. 1999 supra . The collision energy or the LCQ was set at 30%. 



Example 4 : Isolatj 




The methods of this invention were also used to purify and detect phosphopeptides 
from bovine p-casein, a well-characterized phosphoprotein. The peptide was labeled as 
described in Example 2. A tryptic digest of the phosphoprotein was analyzed by 
microcapillary LC-MS/MS. As shown in Fig. 4 A, numerous peptides were observed for the 
untreated p-casein digest. The peptide indicated in Fig. 4A was a doubly charged ion at 
w/z=1031.6. When selected for fragmentation via collision induced dissociation (CID) (Fig. 
4C) (Papayannopoulos, LA. (1995), "The interpretation of collision-induced dissociation 
tandem mass spectra of peptides," Mass Spectrometry Rev. 14,49-73), its fragment ion 
spectrum exhibited mostly the y-ion series typical for low energy peptide fragmentation and 
an additional major signal at w/z=983.0 corresponding to a loss of 98 Da due to the loss of 
the H 3 P0 4 group from the parent ion(Jonscher, K.R. and Yates, J.R. Ill (1997), "Matrix- 
assisted laser desorption ionization/quadrupole ion trap mass spectrometry of peptides. 
Application to the localization of phosphorylation sites on the P protein from Sendai virus," 
J. Biol. Chem. 272, 1735-1741; Qin, J. & Chait, B.T. (1997), "Identification and 
characterization of posttranslational modifications of proteins by MALDI ion trap mass 
spectrometry," Anal Chem. 69, 4002-4009). Database searching of this CID spectrum 
identified a peptide with sequence FQS*EEQQQTEDELQDK (* denotes a phosphate group). 
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The mass difference between the yl3 and yl4 ions corresponded to that of phosphoserine, 
confirming Ser-50 of this protein as the known site of p-casein phosphorylation. 

Subjecting the same p-casein digest to the phosphopeptide isolation procedure 
reduced sample complexity considerably, yielding only one significant doubly charged 
peptide ion at m/z=l 182.5 (Fig. 4D). The CID spectrum of this peptide showed a clear 
fragment ion series and a major signal at m/z=\ 133.6 due to the loss of H 3 P0 4 (Fig. 4D). 
Database searching of this spectrum identified the same peptide as that in Fig. 4B. The 
increase in apparent mass for the same peptide (compare Fig. 4A with 4B, and 4C with 4D) is 
due to quantitative modification on all seven carboxylates (aspartic acid, glutamic acid, and 
the C terminus) with ethanolamine during the isolation procedure. 

Example 4 

Sample recovery efficiency for the steps of the invention was examined using a 
phosphotyrosine-containing peptide, the myelin-basic protein (MBP). MBP was 
phosphorylated in vitro using the catalytic domain of the tyrosine kinase Lck and 
radiolabeled ATP (at a known specific activity). The phosphorylated peptide was digested 
with trypsin and 5 pmol of phosphopep tides were isolated as before, except that the 
carboxylate groups were blocked with d4-ethanol-amine. 

The ion chromatogram for the isolated phosphopeptides is illustrat£d<ffFig. 5 A where 
the most prominent ion at m/2 = 630. 1 (2+) was chosen for fragmentation. The CID of this 
ion is shown in Figure 5B. This unambiguously icktfjtifi^cf the phosphopeptide as 
THY*GSLPQK of M.P.(Aebersold, R. et^l-ff^91), "Determination of the site of tyrosine 
phosphorylation a the low picoipetelevel by automated solid-phase sequence analysis," Anal. 
Biochem. 199:51-60} > ^Pfiosphopeptide recovery efficiency over the entire six-step procedure 
was assesseji-bSTmeasuring radioactive counts recovered after each step of the procedure, 
witlrannal yield that was consistently about -20% of the starting material. 
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In vitro kinase reaction 

17 |ag of a baculovirally expressed Lck kinase domain-GST (glutathione 5- 
transferase) fusion protein, 20 jig MBP and 10 |iCi of 32 P-containing ATP were incubated at 
30°C for 1 hour in 40 pi of buffer containing 25 mM Tris (pH 7.5), 10 mM MnCl 2 , 0.25 mM 
ATP. After 1 hour, 6 M urea was added to stop the reaction. Reduction and alkylation were 
performed by the addition of dithiothreitol (DTT) at 10 mM for 30 minutes, followed by 2 
hours of incubation at 50 mM iodoacetamide. Samples were diluted with water to 1 ml, and 1 
|j.g of trypsin (Promega, Madison, WI) was added for 4 hours at 37°C. Peptides were then 
desalted on a CI 8 column (Waters, MA, cat. no. WAT023590) and recovered by elution with 
80% acetonitrile/0.1% TFA. Recovered radiolabeled peptides were quantified via Cerenkov 
counting. From this, an estimated 5 pmol of phosphorylated peptides were taken for isolation 
of phosphopeptides and evaluation of recovery efficiencies. Deuterated d4-ethanolamine 
(Isotec, Miamisburg, OH) was used to block carboxylate groups in this experiment. 



Example 5 : Phosphoprotein profiling in yeast 

Yeast S. cerevisiae strain (BWG1-7A) was grown till mid-log phase op^fTO media 
with 2% glucose as a carbon source and harvested by centrifugation. Rcetein extract was 
prepared by the glass beads method as described in Current Ppefocols in Molecular Biology 
(New York, J. Wiley). A mixture of DNAse 1 (20 U/pal)1ind RNAse (10 ng/ml) was added 
for 30 minutes on ice. Protein concentration^e^aetermined using Biorad protein assay and 
500 \ig of the protein extract was th^p^denatured in 0.1 M potassium phosphate buffer (pH 
8.0) with 6 M urea. Proteins^v^re reduced and alkylated by addition of DTT (lOmM, 30 
min)followed by 2 hpufs of incubation with iodoacetamide. Samples were then dialyzed 
prior to diggstftJn overnight with trypsin at 37 °C. The resulting peptide mixture was desalted 
bymrase phase CI 8 column as described above. Samples were treated as in Example 1. 

Phosphopeptides were isolated by this methoia»d^fiaIyzed by LC-MS/MS, with CID 
spectra being recorded and searclied-^gainstthe yeast sequence database. Fig. 6A shows the 
total ion inten§ii^H^or3ed with respect to retention time on the column, indicating the 
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complexity of the sample. Fig. 6B shows the m/z values observed integrated over the tiipe 
window indicated in Fig. 6 A. The major peptide peaks observed which also displaced loss of 
98 Da during CID are labeled with an asterisk (*), confirming that the m^rnty of the 
peptides detected were indeed phosphorylated. Further more, the selectivity of the method 
was apparent by virtue of the fact that over 80% of the Cip^pectra that led to identification 
were derived from phosphopeptides. Additionally^jSID spectra derived from the few non- 
phosphorylated peptides identified generallyfesulted from lower intensity precursors ions. 
Thus even with a highly complex stating material, only low level of non-specific peptide 
background carried throughJh€isolation procedure to the MS, affirming its selectivity. In the 
example shown, the iprfat m/z=l 032 J in Fig. 6B was selected for CID, this spectrum being 
shown in Fig^fiC In addition to observing a clear fragment ion series, a major signal 
correspetfiaing to the doubly charged parent ion after undergoing the loss of H 3 P0 4 is 
apparent at ra/z=983.8. 



Following database searching, the peptide was identified as being frompntJlase and 
having the indicated sequence. This peptide contained three potential^hfgonine 
phosphorylation sites, and the parent ion mass indicated th^t-thepeptide contained a single 
phosphate group. The y-7 to y-13 ions confinnedjhafuie phosphate was not on the N- 
terminal threonine. Two possible pairsaf^ftey-5 and y-6 ions correspond to either one of the 
other two threonine residues hekl^phosphorylated. Thus the exact site of phosphorylation for 
this peptide could n9^b€aetermined. Further, the possibility that a mixture of both possible 
mono-phospbdrylated species of this peptide may have co-eluted from the LC column can not 
be^tcluded. 



Table 1 lists the proteins (and gene names) identified following database searching of 
the additional CID spectra obtained in the same manner, along with the sequences of the 
phosphopeptides determined. All peptides positively identified were singly phosphorylated 
species and they were phosphorylated at serine or threonine residues. Table 1 also indicates 
the location of the phosphorylation site within the peptide when this could be unequivocally 
determined, or gives the possible phosphorylation sites in cases in which the observed CID 
data could not distinguish between two or more phosphorylation sites. As was seen in Fig. 
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6C, such occasions did not interfere with the identification of the phosphopeptide, and the site 
of phosphorylation could typically be confined to a cluster of hydroxyl amino acids. 

No peptides with multiple serine or threonine phosphorylation sites were identified. In 
many cases, ions corresponding to loss of H 3 P0 4 dominated the fragmentation process, 
resulting in insufficient fragmentation at peptide bonds for sequencing. This effecptvcJuld be 
compounded by multiple phosphoserine- or phosphothreonine sites in agkrgle peptide. 
Peptides of too large or small sizes are generally not suitable fop-MS sequencing; 
additionally, such peptides could be lost during the dgsaiting steps in this method. Whether 
this method could completely determine alljfe^phosphorylation sites of a given protein 
therefore depends on whether phosphorylation sites are contained in peptides of suitable 
sizes/hydrophobicities fopMS analysis, a limitation common to all MS-based methods. In 
such cases, alternative proteolytic enzymes can be considered. In the experiment on yeast 
lysate^tyrtSsine phosphorylated peptides were not identified, likely due to its significantly 
itfwer abundance. 



Most of the proteins identified were found to be glycolytic enzymes, including 
enolase, glyceraldehyde 3-phosphate dehydrogenase, phosphoglycerate kinase and pyruvate 
kinase. Since the cells from which the proteins were isolated utilized glucose as a carbon 
source, the identification of phosphorylation sites on glycolytic enzymes as being the major 
species present in this sample is perhaps not surprising. Phosphopeptides from other highly 
expressed proteins, such as pyruvate decarboxylase isozyme 1 , ribosomal proteins and heat 
shock proteins were also identified. Interestingly, most of the proteins listed in Table 1 were 
not annotated in the database as being known phosphoproteins. However, previous studies 
from our group and others identified many of the proteins listed in Table 1 in multiple 2D gel 
spots (Gygi, S.P. et al. (1999), "Correlation between protein and mRNA abundance in yeast," 
Mol Cell Biol. 19, 1720-1730; Futcher B. et al. (1999) Mol Cell Biol 19, 7357-68), 
consistent with there being differentially phosphorylated forms of the same protein. These 2D 
gel data are thus consistent with the identifications made here, supporting the assertion that 
these proteins are indeed phosphorylated in vivo. Although phosphopeptides from lower 
abundant, regulatory proteins were not identified in this experiment, the method itself 
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nonetheless is compatible with larger scale sample preparation, or for analysis of enriched 
protein complexes of interest. Subsequent fractionation either prior to or following this 
isolation should greatly facilitate identification of lower abundance proteins. 

LC-MS/MS and database analysis 

An LCQ ion trap mass spectrometer (Finnigan MAT, CA) was used with a HP1 100 
solvent delivery system (Agilent, CA). Peptides were pressure-loaded onto the column, then 
eluted and analyzed by microcapillary LC-MS/MS as described previously (Gygi, S.P. et al. 
(1999), "Quantitative analysis of complex protein mixtures using isotope-coded affinity tags," 
Nat. Biotechnol. 17, 994-999). The collision energy for the LCQ was set at 30%. SEQUEST 
(Eng, J. et al. (1994), "An approach to correlate tandem mass spectral data of peptides with 
amino acid sequences in a protein database," J. Am. Soc. Mass Spectrom. 5, 976-989) 

was used for searching yeast YPD database for peptide sequence and identification of site(s) 
of phosphorylation. Search parameters included differential mass modification to serine, 
threonine and tyrosine due to phosphorylation, static mass modification to aspartate, 
glutamate and C-terminus due to the ethanolamine tag, and static mass modification to 
cysteine due to alkylation by iodoacetamide. Differential mass modification means both 
possibilities of modified or unmodified amino acid residues were used in database search, 
while static mass modification means only the modified amino acid residue was used. 

Using procedures similar to those outlined above, phosphopeptides were isolated and 
identified from Jurkat cells. A list of phosphopeptides identified in these experiments is 
provided in Table 2. 

Those of ordinary skill in the art will appreciate that protective groups, labels, 
reagents, solid phase materials, acid treatments (mild or strong), isotopic labels, purification 
and washing procedures other than those specifically disclosed herein can be employed to 
practice the method of this invention. A variety of functional equivalent reagents, methods 
and techniques in addition to those specifically disclosed herein are known in the art and can 
be readily employed or adapted without expense of undue experimentation to the practice of 
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this invention. All art-known functional equivalents and known variants of the materials and 
methods specifically used herein are intended to be encompassed by this invention. 

All references cited herein are incorporated by reference herein to the extent that they 
are not inconsistent herewith. 
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Table 1. Phosphopeptide profile from yeast grown with glucose as a carbon source. 

Gene name a Protein name Phosphopeptide identified b 



EN01_yeast & 
EN02_yeast 


Enolase 


TAGIQIVADDLT*VT*NPAR c 
IGLDCAS*S*EFFK c 
SGET*EDT*FIADLVVGLR c 


G3Pl_yeast 


Glyceraldehyde 3- 

phosphate 

dehydrogenase 


LVSWYDNEYGYS*T*R c 

VIS *NASCTTNCLAPL AK 
VISNASCT*T*NCLAPLAK c 
TASGNIIPSST*GAAK 


DCPl_yeast 


Pyruvate decarboxylase 
isozyme 1 


NPVILADACCS*R 

TP AN AA VP AS *T*PLK c 


KPYl_yeast 


Pyruvate kinase 1 


G VNLPGTD VDLP ALS * EK 
GVNLPGT*DVDLPALSEK 


PGK_yeast 


Phosphoglycerate 
kinase 


DVT*FLNDCVGPEVEAAVK 

VLENT*EIGDSIFDK 

EGIPAGWQGLDNGPES*R 

ASAPGS*VILLENLR 

bLr Cj V Ar L b * JiK 


PGMl_yeast 


Phosphoglycerate 
mutase 1 


SFDVPPPPIDASSPFS*QK 
VYPDVLYT*S*K c 


ALF_yeast 


Aldolase 


F AIP AINVT *S*S*S*T* A V AALE AAR c 


G6PI_yeast 
HS75_yeast 


Glucose-6-phosphate 
lsomerase 

Heat shock protein 


EANVT*GLR 
SQIDEVVLVGGS*T*R c 


HS72_yeast 


Heat shock protein 


TTPSFVGFTDT*ER 


RLll_yeast 


60s ribosomal protein 


VLEQLSGQTPVQS*K 


R141_yeast 


40s ribosomal protein 


IEDVTPVPS*DS*T*R c 



38 



a Gene names are according to Swiss-Prot nomenclature ( www.expasv.ch ). 

b Sequence and site of phosphorylation was identified by SEQUEST 18 (see text). 

c Multiple marks of asterisk indicates ambiguity on the exact site of phosphorylation. All 
peptides are singly-phosphorylated. 

♦Indicating site of phosphorylation at the serine or threonine residue to its left. 
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Table 2: 



Results on phosphoproteins in Jurkat Cells 



Human GAP SH3 binding protein 

Tumor necrosis factor type 1 receptor 
associated protein 

Thyroid hormone receptor-associated 
protein complex component TRAP 150 
mRNA 

Stathmin (phosphoprotein pi 9) 
Alpha enolase 

Glyceraldehyde 3 -phosphate dehydrogenase 
Pyruvate kinase 

L-lactate dehydrogenase h chain 



SSSPAPADIAQTVQEDLR 
GVVDSEDIPLNLSR 

ASAVSELSPR 

ASGQAFELILSPR 

AAVPSGASTGIYEALELR 

VPTANVSVVDLTCR 

NTGIICTIGPASR 

VIGSGCNLDSAR 
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